Presentation 


Back  to  Agenda 


The  Morphware  Stable  Interface: 

A  Software  Framework  for  Polymorphous  Computing  Architectures 


Daniel  P.  Campbell 
Mark  A.  Richards 

Georgia  Institute  of  Technology 
Atlanta,  GA 

770-528-7541  /  dan.cannpbell@gtri.gatech.edu 


Dennis  M.  Cottel 
Randall  R.  Judd 

SPAWAR  Systems  Center 
San  Diego,  CA 


Kenneth  M. 
Mackenzie 

Reservoir  Labs,  Inc. 
New  York,  NY 


Abstract 

Polymorphous  Computing  Architectures  (PCAs)  are  computing 
devices  that  are  capable  of  significant,  rapid  reconfiguration 
directed  by  software.  Composed  of  several  groups  of  computing 
elements,  PCA  devices  can  be  configured  to  achieve  high 
performance  on  a  wide  variety  of  problem  types  and  processing 
demands.  We  describe  an  emerging  concept  called  the 
Morphware  Stable  Interface  (MSI),  a  portable,  scalable  software 
development  framework  to  harness  the  power  and  complexity  of 
PCAs,  while  allowing  productive  software  development,  and 
rapid  adoption  of  PCA  devices. 

1.  INTRODUCTION 

The  Polymorphous  Computing  Architectures  (PCA)  program  is  a 
Defense  Advanced  Research  Projects  Agency  (DARPA)  effort  to 
develop  new  embedded  computing  platforms  with  very  strong, 
rapid  in-mission  reconfigurability.  Target  applications  range 
from  military  platforms  that  must  adapt  to  rapidly  changing 
mission  parameters,  to  embedded  network  controllers,  whose 
optimal  configuration  of  hardware  resources  will  change  in 
response  to  the  traffic  and  environmental  conditions  it  faces. 

The  PCA  program  “core  projects”  working  to  develop 
microprocessors  that  implement  polymorphous  capabilities 
include  Smart  Memories  [1],  Raw  [2],  M3T  [3],  TRIPS  [4],  and 
MONARCH  [5].  The  chips  under  development  in  these  projects 
have  several  characteristics  in  common.  These  are  typically  tiled 
structures  composed  from  replicated,  fully  capable  computing 
cores,  reconfigurable  memory  and  cache  elements,  and  a  rich  set 
of  reconfigurable  data  paths,  network  interfaces,  and  I/O  paths. 
Each  can  operate  in  streaming  or  threaded  modes.  Each  has 
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mechanisms  for  aggregating  individual  processor  tiles  into  larger 
compound  processor  units.  They  differ  in  their  approach  for 
aggregating  processors  and  in  their  emphasis  on  processor, 
memory,  or  communication  design.  Figure  1  illustrates  a  generic 
PCA  microarchitecture. 

The  increased  capability  of  PCA  systems  comes  at  the  expense  of 
increased  software  complexity.  Applications  written  with 
knowledge  about  the  platform  embedded  into  the  structure  of  the 
application  can  make  use  of  the  reconfigurability  of  such 
resources,  but  suffer  from  a  lack  of  scalability  and  portability. 
Applications  written  with  no  such  knowledge  are  completely 
dependent  on  build  and  run-time  systems  to  utilize  the  capabilities 
of  reconfigurable  systems. 

An  important  goal  of  the  PCA  program  is  therefore  to  create  an 
application  development  framework,  called  the  Morphware  Stable 
Interface  (MSI),  that  will  exploit  the  capabilities  of  PCA 
hardware  while  retaining  as  much  portability  and  performance  as 
possible.  The  MSI  should: 

•  Support  dynamic  hardware  reorganization  and  optimization 

•  Obtain  nearly  optimal  performance 

•  Abstract  configurable  computing  elements 

•  Abstract  hardware  reconfiguration 

•  Mitigate  development  and  runtime  complexity 

•  Eeverage  existing  technologies. 

This  paper  presents  the  early  results  of  the  MSI  design  effort. 

2.  THE  MORPHWARE  FORUM 

To  facilitate  the  development  of  the  cross-project  consensus  and 
design  effort  needed  to  realize  the  MSI,  the  PCA  program  has 
formed  the  Morphware  Forum,  an  informal  consortium  of  the 
PCA  contractors  and  other  selected  participants.  Organized  and 
led  by  Georgia  Tech  under  DARPA  sponsorship,  the  Forum  meets 
quarterly,  with  other  interim  activities  as  required,  to  develop  and 
debate  proposals  for  the  MSI.  The  Forum  will  ultimately  develop 
a  set  of  publicly  available  standards  documents  that  define  the 
architecture  and  details  of  the  MSI. 

Georgia  Tech  maintains  a  Morphware  Forum  web  site  at 
http://www.morphware.org  that  provides  a  vehicle  for  meeting 
planning,  collaborative  exchange,  and  public  information  about 
the  MSI  effort.  At  this  writing,  the  public  portion  of  the  site  is 
limited  to  introductory  papers  and  briefings  on  the  MSI  effort  and 
the  PCA  systems,  and  links  to  program  participants’  web  sites. 
MSI  standards  documents  will  be  available  at  this  site  as  the 
Forum  approves  them. 


Figure  1.  Generic  PCA  chip  micro-architecture. 
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3.  MORPHWARE  STABLE  INTERFACE 

The  MSI  is  a  framework  eonsisting  of  several  major  elements. 
Some  of  the  most  signifieant  aspeets  of  the  MSI  are  deseribed. 

3.1  The  Streaming  Virtual  Machine 

Several  of  the  projeets  have  developed  speeialized  languages  to 
exploit  the  differenees  between  PCAs  and  traditional  CPUs.  The 
highly  parallelized,  low  unit-to-unit  lateney  of  PCA  deviees 
exposes  a  need  for  eompilers  to  more  fully  understand  data 
dependeneies  and  eontrol  flow  within  an  applieation  than  is 
possible  with  languages  sueh  as  C.  In  order  to  expose  these 
elements  more  elearly  to  eompilers,  the  projeets  have  developed 
variants  of  C  and  C++  that  introduee  and  enforee  data-stream,  and 
operational  kernel  eonstruets.  These  language  express  programs 
as  direeted  data  flow  graphs  eonneeting  various  eomputational 
kernels.  The  restrietions  on  data  and  eontrol  flow  allow  eompilers 
to  map  algorithms  very  effeetively  to  PCA-style  deviees,  while 
simplifying  souree  eode  for  a  wide  range  of  applieations.  These 
languages  express  a  fundamentally  different  underlying  virtual 
maehine  than  languages  sueh  as  C,  and  form  an  important  aspeet 
of  the  MSI. 

3.2  MSI  Portability  Layers 

From  its  ineeption,  the  PCA  program  has  advoeated  dual 
portability  layers  for  the  MSI.  The  MSI  provides  portability  via 
the  upper  “Stable  API”  (SAPI)  and  the  lower  “Stable  Arehiteeture 
Abstraetion  Layer”  (SAAL).  Figure  2  illustrates  the  eoneept.  It 
reduees  development  effort  for  applieation  build  systems  by 
providing  a  eommon  middle  layer;  allows  addition  of  new  top 
level  applieation  approaehes  without  breaking  existing  build 
systems;  provides  a  stable,  platform-independent  target  for  top 
level  build  tools;  allows  dynamie  eompilation,  and  inereases 
speeialization  opportunities  in  the  builder/middleware 
marketplaee. 

The  SAPI  eonsists  of  multiple  platform-independent,  high-level 
languages  and  metadata  representations.  These  languages  will  be 
extensions  to  existing  languages  sueh  as  C  or  C++  that  express 
eaeh  of  the  virtual  maehine  abstraetions  present  in  the  SAAL. 
The  SAAL  will  be  a  portable  C-based  representation  of  a  virtual 
maehine  with  both  streaming  and  threaded  modes.  It  will  be  both 
platform-  and  SAPI  language-independent.  Finally,  it  will  be 
lower  level  than  SAPI  languages,  and  able  to  support  dynamie 
eompilation.  The  Morphware  Forum  is  eurrently  aetively 
developing  detailed  proposals  for  the  SAAL  VM. 


Application  Software 

Algorithm  descriptions,  Constraints, 
Performance  Requirements 

1  ^ 

Stable  APIs  (SAPI) 

Collection  of  portable  APIs 

, _ A _ ,  ' - ’ 

Stable  Architecture 
Abstraction  Layer  (SAAL) 

/J  Portable,  low-level  hardware 

abstractions 

Binary  Executables 

Hardware 

3.3  Component-Based  Framework 

The  ability  of  applieations  to  intelligently  reeonfigure  host 
platforms  is  a  eritieal  element  of  PCA  software.  In  order  to 
faeilitate  morphing,  a  eomponent-based  software  framework  has 
been  proposed.  Components  provide  natural  and  intuitive 
boundaries  for  run-time  reeonfiguration  of  hardware.  Software 
eomponents  will  be  built  for  varying  hardware  eonfigurations,  and 
the  appropriate  version  may  be  loaded  and  exeeuted  on  the  fly  as 
portions  of  the  host  platform  are  reeonfigured  by  the  PCA  system 
or  applieation  software. 

3.4  Metadata 

PCA  systems  must  to  be  aware  of  and  reaetive  to  applieation, 
resouree,  and  eonstraint  goals  and  requirements.  Examples 
inelude  SWEPT  (size,  weight,  energy,  performanee,  and  time) 
eonstraints,  quality  of  serviee  requirements  (e.g.  lateney, 
throughput),  morph  poliey,  seeurity  requirements,  and  so  forth. 
Metadata  provides  a  means  for  the  MSI  to  represent  the 
information  needed  to  implement  this  eapability. 

Metadata  are  non-funetional  deseriptions  of  requirements, 
eonstraints,  desired  resouree  management  poliey,  hardware 
eonfiguration  options  or  any  other  information  that  expresses 
information  about  system  operation  independent  of  the 
applieation  funetional  deseription.  Eaeh  of  the  uses  of  metadata 
eonstitutes  a  unique  eontext  that  must  be  standardized  and 
deseribed.  A  standard  method  for  deseribing  metadata  eontexts 
has  been  proposed  and  is  eurrently  under  eonsideration  by  the 
Morphware  Forum.  Speeifieations  for  several  metadata  eontexts 
have  been  proposed,  and  standard  appropriate  storage,  retrieval, 
and  query  meehanisms  for  the  metadata  are  eurrently  being 
designed. 
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Figure  2.  SAPI  and  SAAL  MSI  portability  layers. 
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DARF^  Polymorphous  Computing  Architectures 


■  DARPA  effort  for  high  performance 
embedded  platforms  with  strong,  rapid, 
reactive  in-mission  configurability 

□  Support  dynamic  and  multi-mission  requirements 

□  Support  collaborative,  information-centric  missions 

■  PCA  will  develop  processing  architectures 
that  “morph” 

□  Hardware  and  software  resources  reconfigure  to  balance 
resource  requirements  and  availability 

•  at  multiple  levels:  micro-architecture,  network,  system 

•  at  multiple  time  scales:  in-mission,  between-mission 
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Generic  PCA  Microarchitecture 
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Reconfigurable  processor 
Reconfigurable  memory 
Reconfigurable  cache 


-  Fixed  communication 

.  Configurable  communication 

■  ■  ■  ■  High  Speed  off-chip  I/O 
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Generic  PCA  Microarchitecture 


■Tiled  structure 

□Fully  capable  computing  cores 

□Configurable  memory  and  cache 

□Configurable  data  paths,  network  interfaces,  and  I/O 

□Streaming  and  Threaded  modes 

□Methods  to  aggregate  tiles  into  larger  processing  units 

■Core  projects  differ  in 

□aggregation  mechanisms 

□relative  emphasis  on  processor,  memory,  or  comm  design 

■  Performance  on  the  order  of  (per  chip) 

□4  -  64  GFLOPS  /  4  -  16  GOPS 
□25  -  32  GB/s  off-chip  I/O 
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Software  and  PCA 


■  Increased  hardware  flexibility  and  complexity 
brings  increased  software  complexity 

□  If  we  build  target  platform  reconfigurability  and  performance 
info  into  the  appiication,  we  iose  scaiabiiity  and  portabiiity 

□  If  we  don’t,  the  build  and  run-time  systems  wiii  be  entireiy 
responsibie  for  ieveraging  the  piatform  capabiiity,  and  we 
stiii  iose  fine-grain  morphabiiity 

□  Appiications  must  be  reactive  to  feedback  from  the  hardware 

•  resource  collisions,  SWEPT,  faults 

■  Solution:  the  Morphware  Stabie  Interface  (MSI) 
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The  Morphware  Stable  Interface  (MSI) 

■  Application  Development  Framework  for  PCAs 

■  Comprised  of  a  software  architecture  and  a  suite  of 
open  standard  APIs 

■  Goals 

□  Dynamically  optimize  PCA  resources  for  application  functionality, 
service  requirements,  and  constraints 

□  Obtain  nearly  optimal  performance  from  PCA  hardware 

□  Be  highly  reactive  to  PCA  hardware  and  user  inputs 

□  Manage  PCA  software  complexity 

□  Leverage  existing  and  already-developing  technologies 

■  Cross-project  effort,  developed  in  parailel  with  the 
hardware 
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The  Morphware  Forum 
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■  Informal  consortium  of  the  PCA  contractors  and 
other  selected  participants 

■  Organized  and  led  by  the  Georgia  Tech/SPAWAR 
team 

■  Meets  quarterly 

□  interim  meetings  and  activities  as  required 

■  Propose,  debate,  develop,  test,  validate, 
document,  and  demonstrate  standards  that  define 
the  MSI 
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The  Morphware  Forum 


■Applied  Photonics 
■Georgia  Institute  of  Technology 
■George  Mason  University 
■IBM 

■Lockheed  Martin  Company 
■Massachusetts  Institute  of  Technology 
■MIT/Lincoln  Laboratory 
■Mercury  Computing  Systems 
■Mississippi  State  University 
■MPI  Software  Technology,  Inc. 


■Northrop  Grumman 

■  Reservoir  Labs,  Inc. 

■  Raytheon 
■SPAWAR 

■South  West  Research 
■Stanford  University 

■  University  of  Texas  -  Austin 

■  University  of  lilinois 

■  University  of  Pennsylvania 

■  University  of  Southern  Caiifornia 
■Vanderbilt  University 
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Dual  Portability  Layers 


stable  API  (SAPI)  and  Stable 
Architecture  Abstraction 
Layer  (SAAL)  provide  duai 
portabiiity  iayers 

Appiication  SW  describes 
functionaiity,  constraints,  and 
performance  requirements 

SAPI  is  PCA-aware  coiiection 
of  standardized  ianguage  and 
service  APIs 

SAAL  is  PCA-aware 
abstracted  iow-ievei  machine 
representations 
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Why  SAAL? 


■  Traditional  languages  based  on  a  machine  model 
increasingly  incorrect 

□  Single  program  counter 

□  One  operation  at  a  time 

□  Data  universally  local 

■  All  modern  high  performance  computing  systems  battling 
this  issue 

■  In  order  to  exploit  new  hardware,  core  teams  developed 
new  languages  not  based  on  old  model 

■  New  languages  based  on  similar  models 

■  Formalize  the  models  to  make  it  explicit 
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Stream  Languages 


■  Compute-intensive  portions  of  many  appiications  have 
characteristics  of  stream  operations 

□  fixed  data  flow  graph 

□  large,  possibly  infinite,  data  stream 

□  functional  kernels  not  data-dependent 

□  functional  kernels  independent  of  one  another 


■  Representations  that  enforce  these  characteristics  ideaily 
suit  PCA  architectures,  aid  compiier  in 

□  Optimization 

□  Scheduling 

□  Resource  allocation 

□  Data  Locality 
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Morphware  Languages 
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SAAL  Instantiation 


■  Traditional  languages  have  an  implicit  SAAL  layer 

■  MSI  has  an  explicit  SAAL  layer,  a  portable  API  that  exposes 
the  virtual  resources  typical  of  PCA  systems 

□  Sacrifices  some  tooi  chain  fiexibiiity  for  simpier,  more  defined,  more 
anaiyzabie  buiid  chain 

□  Factors  depioyment  of  new  ianguages  and  hardware 

□  Aiiows  expiicit  consideration  of  modei  of  computer 

□  Formaiizes  and  augments  existing  modei 

■  Creates  a  two-stage  compile  process 

■  Example  constructs:  kernel,  stream,  processor,  etc 
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Stable  Architecture 

Virtuai  Machine  API 
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Stream  VM 

Thread  VM 
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Low  Level  Compilers 
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Metadata  in  Morphware 


■  PCA  Hardware  is  complex  and  changing 

■  PCA  Missions  are  complex  and  changing 

■  Large  amount  of  configuration,  constraints, 
requirements,  etc.  information  in  addition  to 
functional  requirement 

■  Extracting  and  encapsulating  this  information 

□  Increases  portability,  scaiabiiity 

□  Faciiitates  Reconfiguration,  Repurposing,  Redepioyment 

□  Is  an  important  goai  of  most  modern  software  systems 
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Metadata  System 


■  Metadata  needed  throughout  the  PCA  system 

□  Several  contexts 

□  Consistent  method  of  representation  &  query  preferred 

□  Needed  to  enabie  processor  and  compiier  deveiopers  to  progress 

■  Current  system  stores  metadata  as  XML 

□  Metadata  is  expressed  as  relational,  hierarchical  object  oriented  structure 

□  Instantiated  as  XML 

□  Contexts  are  defined  by  a  Schema  and  Documentation 

□  Accommodates  procedurai  or  static  representation  queries 

□  Accessible  to  wide  range  of  API’s,  tools,  etc. 
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■  Needed  by  HLC  to  improve  VM  output 

□  Helps  allow  coarse  grain  partitioning  of  appiications  into 
appropriate  sized  pieces 

■  Nearly  complete,  minor  fixes  remain 

■  Describes  target  platform  using  common 
dictionary  of  virtual  resources  and  attributes 

□  Processors:  type,  frequency,  max-IPC,  iatency... 

□  Memories:  type,  size,  cache-iinesize,  associativity... 

□  Net-Links:  senders,  receivers,  iatency,  bandwidth 


I  [^<s®(ssiir@lhi 

II  OmsMjiQ® 


Georgia 

Tech 


Di^A9  Dynamic  Configuration 


■  Model  so  far  good  for  flexible  resources,  goals  & 
constraints 

□  Two  level  compile 

□  structured  VM  code 

□  Well  defined  metadata 

□  Good  compilers 

■  Dynamic  resources,  goals,  &  constraints  much 
harder  problem 

□  Builds  have  (nearly)  infinite  time  to  anaiyze  &  search  the 
soiution  space,  run-time  changes  must  happen  quickiy 

□  Static,  configurabie  buiid  parameters  a  hard,  but  tenabie  task 

□  Support  for  dynamic  criteria  expiodes  the  soiution  space 
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Alternate  Monoliths 


■  Build  with  several  parameters 

□  Traverse  build  chain  with  a  defined  set  of  constraints,  goals, 
resources  expected 

□  Deploy  binaries  for  each  set 

□  Select  the  best-fit  binary  at  run-time 

■  Benefits 

□  Build  chain  sooner 

□  Easier  problem,  faster  builds 

□  Known,  testable  states 

□  Better  optimization  for  known  states 

■  Problems 

□  Problems  with  unexpected  hardware  states 

□  Not  as  flexible  as  the  hardware 

□  Only  optimal  for  expected  states 
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Di^A9  Component-Based  Approach 


■  Flexibility  &  resilience  gained  by  partitioning  physical  resources 

■  Configure  each  partition  independently 

■  Build  binaries  for  each  partition  in  various  states 

■  Benefits: 

□  Smaller  problem  makes  flexible  build  criteria  more  feasible 

□  Hierarchical  approach  factors  the  problem  of  resource  management 

□  Able  to  match  run-time  needs  more  closely 

□  Able  to  achieve  top  performance  in  more  situations 

□  More  easily  respond  to  hardware  failures  &  changes 

■  Problems 

□  Requires  a  more  robust  run-time  system  to  fully  exploit 

□  Many  states  possible  -  complicates  testing 

□  Framework  bloat 
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Di^A9  Morphware  Forum  Steps 


■  Priority:  End-to-End  framework  that  allows  an  application 
that  can  reconfigure  it’s  platform 

■  Immediate  priorities: 

□  Finish  TVM 

□  Finish  HWMD 

□  Define  HLC  /  LLC  Interaction 

□  Determine  run-time  services 

•  Load,  unload,  configure,  measure,  etc. 

□  Consider  component-based  approaches 

■  Continue  regular  activities 

□  Quarterly  meetings,  interims,  draft  documents,  etc 
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■  The  Morphware  Forum 
web  site  provides  some 
pubiic  information 

□  Selected  public  papers  & 
briefings 

□  Links  to  PCA  project 
sites  and  related  links 

□  Link  to  DARPA  PCA 

□  This  paper  and 
presentation,  soon 

■  In  the  future,  it  will 
provide  one-stop 
public  dissemination 
of  MSI  documents 
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